Search results for " Clustering"

showing 10 items of 312 documents

Penalized regression and clustering in high-dimensional data

The main goal of this Thesis is to describe numerous statistical techniques that deal with high-dimensional genomic data. The Thesis begins with a review of the literature on penalized regression models, with particular attention to least absolute shrinkage and selection operator (LASSO) or L1-penalty methods. L1 logistic/multinomial regression models are used for variable selection and discriminant analysis with a binary/categorical response variable. The Thesis discusses and compares several methods that are commonly utilized in genetics, and introduces new strategies to select markers according to their informative content and to discriminate clusters by offering reduced panels for popul…

High-dimensional dataQuantile regression coefficients modelingTuning parameter selectionGenomic dataLasso regressionLasso regression; High-dimensional data; Genomic data; Tuning parameter selection; Quantile regression coefficients modeling; Curves clustering;Settore SECS-S/01 - StatisticaCurves clustering
researchProduct

A 3-D marker-free system for the analysis of movement disabilities--an application to the legs.

2001

The aim of this paper is to describe an approach allowing the analysis of human motion in three-dimensional (3-D) space. The system that we developed is composed of three charge-coupled-device cameras that capture synchronized image sequences of a human body in motion without the use of markers. Characteristic points belonging to the boundaries of the body in motion are first extracted from the initial images. Two-dimensional superquadrics are then adjusted on these points by a fuzzy clustering process. After that, the position of a 3-D model based on a set of articulated superquadrics, each of them describing a part of the human body, is reconstructed. An optical flow process allows the pr…

Motion analysisLegFuzzy clusteringFuzzy classificationComputer sciencebusiness.industryMovementOptical flowGeneral MedicineGaitMotion (physics)Computer Science ApplicationsGait (human)Motion fieldFuzzy LogicGait analysisSuperquadricsHumansComputer visionDisabled PersonsArtificial intelligenceElectrical and Electronic EngineeringbusinessBiotechnologyIEEE transactions on information technology in biomedicine : a publication of the IEEE Engineering in Medicine and Biology Society
researchProduct

Bayesian Markov switching models for the early detection of influenza epidemics

2008

The early detection of outbreaks of diseases is one of the most challenging objectives of epidemiological surveillance systems. In this paper, a Markov switching model is introduced to determine the epidemic and non-epidemic periods from influenza surveillance data: the process of differenced incidence rates is modelled either with a first-order autoregressive process or with a Gaussian white-noise process depending on whether the system is in an epidemic or in a non-epidemic phase. The transition between phases of the disease is modelled as a Markovian process. Bayesian inference is carried out on the former model to detect influenza epidemics at the very moment of their onset. Moreover, t…

Statistics and ProbabilityEpidemiologyComputer scienceBayesian probabilityMarkov processBayesian inferenceDisease Outbreakssymbols.namesakeBayes' theoremStatisticsInfluenza HumanEconometricsHumansHidden Markov modelModels StatisticalMarkov chainIncidenceBayes TheoremMarkov ChainsMoment (mathematics)Autoregressive modelSpainSpace-Time ClusteringsymbolsRegression AnalysisSentinel Surveillance
researchProduct

An evolutionary restricted neighborhood search clustering approach for PPI networks

2014

Protein-protein interaction networks have been broadly studied in the last few years, in order to understand the behavior of proteins inside the cell. Proteins interacting with each other often share common biological functions or they participate in the same biological process. Thus, discovering protein complexes made of a group of proteins strictly related can be useful to predict protein functions. Clustering techniques have been widely employed to detect significant biological complexes. In this paper, we integrate one of the most popular network clustering techniques, namely the Restricted Neighborhood Search Clustering (RNSC), with evolutionary computation. The two cost functions intr…

Computer sciencebusiness.industryCognitive NeuroscienceNeighborhood searchComputational biologyPPI networks clusteringGenetic algorithmsMachine learningcomputer.software_genreBudding yeastEvolutionary computationComputer Science ApplicationsOrder (biology)Artificial IntelligenceGenetic algorithmArtificial intelligenceEvolutionary approachesbusinessCluster analysiscomputerProtein-protein interaction networks clustering
researchProduct

Statistical power of disease cluster and clustering tests for rare diseases: A simulation study of point sources

2012

Abstract Two recent epidemiological studies on clustering of childhood leukemia showed different results on the statistical power of disease cluster and clustering tests, possibly an effect of spatial data aggregation. Eight different leukemia cluster scenarios were simulated using individual addresses of all 1,009,332 children living in Denmark in 2006. For each scenario, a number of point sources were defined with an increased risk ratio at centroid, decreasing linearly to 1.0 at the edge; aggregation levels were administrative units of Danish municipalities and squares of 5, 12.5 and 25 km 2 . Six statistical methods were compared. Generally, statistical power decreased with increasing s…

MaleAdolescentEpidemiologyDenmarkHealth Toxicology and MutagenesisGeography Planning and DevelopmentDisease clustercomputer.software_genreStatistical powerStatisticsCluster AnalysisHumansComputer SimulationPoint (geometry)Poisson DistributionChildCluster analysisSpatial analysisk-medians clusteringMathematicsStatistical hypothesis testingSpatial AnalysisLeukemiaModels StatisticalData CollectionIncidenceCentroidInfectious DiseasesChild PreschoolFemaleData miningcomputerSpatial and Spatio-temporal Epidemiology
researchProduct

Tensor clustering on outer-product of coefficient and component matrices of independent component analysis for reliable functional magnetic resonance…

2019

Background. Stability of spatial components is frequently used as a post-hoc selection criteria for choosing the dimensionality of an independent component analysis (ICA) of functional magnetic resonance imaging (fMRI) data. Although the stability of the ICA temporal courses differs from that of spatial components, temporal stability has not been considered during dimensionality decisions. New method. The current study aims to (1) develop an algorithm to incorporate temporal course stability into dimensionality selection and (2) test the impact of temporal course on the stability of the ICA decomposition of fMRI data via tensor clustering. Resting state fMRI data were analyzed with two popu…

model ordertoiminnallinen magneettikuvaustensor clusteringfMRIsignaalianalyysistabilityindependent component analysis (ICA)
researchProduct

ProcMiner: Advancing Process Analysis and Management

2007

This paper contributes both to research and practice on process mining. Previous research on process mining has focused on mining patterns from event log files to generate process models. The process mining approach adopted in this paper is focused on producing patterns about process models, not the models themselves. The approach is demonstrated by ProcMiner -an explorative research prototype for management, consolidating, publishing, retrieving, and analyzing process models. Content-based document clustering is applied to process models represented as XML database in order to find topical groups from models. In practice, organizations face numerous challenges in managing their process mod…

klusterointiProcess modelingprosessitiedon analysontiEvent (computing)Computer sciencecomputer.internet_protocolprocess miningProcess miningDocument clusteringXMLcomputer.software_genreData scienceprosessijohtaminendocument clusteringConsistency (database systems)XML databaseQuality management systemprosessien hallintacomputerXML
researchProduct

An efficient cluster-based outdoor user positioning using LTE and WLAN signal strengths

2015

In this paper we propose a novel cluster-based RF fingerprinting method for outdoor user-equipment (UE) positioning using both LTE and WLAN signals. It uses a simple cost effective agglomerative hierarchical clustering with Davies-Bouldin criterion to select the optimal cluster number. The positioning method does not require training signature formation prior to UE position estimation phase. It is capable of reducing the search space for clustering operation by using LTE cell-ID searching criteria. This enables the method to estimate UE positioning in short time with less computational expense. To validate the cluster-based positioning real-time field measurements were collected using readi…

ta113SIMPLE (military communications protocol)business.industryComputer scienceReal-time computingLTE cell-IDFingerprint recognitionGridminimization of drive testsDetermining the number of clusters in a data setEmbedded systemgrid-based RF fingerprintingRadio frequencybusinessCluster analysishierarchical clustering
researchProduct

Computational cluster validation for microarray data analysis: experimental assessment of Clest, Consensus Clustering, Figure of Merit, Gap Statistic…

2008

Abstract Background Inferring cluster structure in microarray datasets is a fundamental task for the so-called -omic sciences. It is also a fundamental question in Statistics, Data Analysis and Classification, in particular with regard to the prediction of the number of clusters in a dataset, usually established via internal validation measures. Despite the wealth of internal measures available in the literature, new ones have been recently proposed, some of them specifically for microarray data. Results We consider five such measures: Clest, Consensus (Consensus Clustering), FOM (Figure of Merit), Gap (Gap Statistics) and ME (Model Explorer), in addition to the classic WCSS (Within Cluster…

clustering microarray dataMicroarrayComputer scienceStatistics as Topiccomputer.software_genrelcsh:Computer applications to medicine. Medical informaticsBiochemistryStructural BiologyDatabases GeneticConsensus clusteringStatisticsCluster (physics)AnimalsCluster AnalysisHumansCluster analysislcsh:QH301-705.5Molecular BiologyOligonucleotide Array Sequence AnalysisStructure (mathematical logic)Microarray analysis techniquesApplied MathematicsComputational BiologyComputer Science ApplicationsBenchmarkingComputingMethodologies_PATTERNRECOGNITIONlcsh:Biology (General)Gene chip analysislcsh:R858-859.7Data miningDNA microarraycomputerAlgorithmsSoftwareResearch ArticleBMC Bioinformatics
researchProduct

Functional linear models for the analysis of similarity of waveforms

2018

In seismology methods based on waveform similarity analysis are adopted to identify sequences of events characterized by similar fault mechanism and prop- agation pattern. Seismic waves can be considered as spatially interdependent three dimensional curves depending on time and the waveform similarity analysis can be configured as a functional clustering approach, on the basis of which the member- ship is assessed by the shape of the temporal patterns. For providing qualitative ex- traction of the most important information from the recorded signals we propose an integration of the metadata, related to the waves, as explicative variables of a func- tional linear models. The temporal pattern…

structured functional principal componentwaveforms clusteringfunctional data depthSettore SECS-S/01 - Statistica
researchProduct